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Abstract 

We propose a new approach for analyzing price fluctuations in 
their strongly correlated regime ranging from minutes to months. This 
is done by employing a self-similarity assumption for the magnitude 
of coarse-grained price fluctuation or volatility. The existence of a 
Cramer function, the characteristic function for self-similarity, is con- 
firmed by analyzing real price data from a stock market. We also 
discuss the close interrelation among our approach, the scaling-of- 
moments method and the multifractal approach for price fluctuations. 



Scaling and self-similarity play an important role not only in physics but 
also in various socio-economic systems including market fluctuations, internet 
traffic, growth rates of firms and social networks. If phenomenological theory 
is shown to be applicable to observations, it would guide one to meaningful 
models and possibly to universal and specific description of such systems 
without any apparent "Hamiltonian" . 



Price fluctuations in speculative markets from physical viewpoint have 
recently received great attention |], 0, ||, f|. Advances have been aided by 
concepts and techniques derived from statistical physics and also by the new 
feasibility of analyzing high-frequency financial data. Considerable statisti- 
cal features have been uncovered that, interestingly, are quite independent of 
asset type (stock/foreign-exchange) and of markets (New York/Tokyo). In 
fact, we shall describe below that price fluctuation is spectrally white, but 
is not statistically independent since its magnitude exhibits a strong correla- 
tion in the form of a power law autocorrelation || ||, [7| (see also references 
in [0). Moreover, some have suggested that price fluctuations might have 
similar properties in different time-scales, that is partially supported by the 
multifractal analysis of price fluctuations [§, |TU| . In this paper we propose 
a new approach to characterizing self-similarity in the strongly correlated 
regime. We actually show the existence of self-similarity and characterize it 
both qualitatively and quantitatively from a new statistical physical view- 
point. Specifically, we directly evaluate the invariant scaling relation for 
fluctuations under coarse- graining of asset price returns. 

Time-scale of our interest is related to the temporal features of the price 
fluctuations known so far. Let P{s) be the asset price at time-step s, then 
the (logarithmic) return at s for duration t is defined by 

r t (s)=\og(P(s)/P(s-t)). (1) 

Return is equal to the fractional change of price after duration t when the 
change is small. We are interested in the fluctuation r t for different values 
of t. Each transaction is recorded for every stock in a particular market at 
the finest time-scale. For the dataset from the New York Stock Exchange 
(NYSE) used below, the typical time interval between successive transactions 
was about 10 seconds for stocks with large (trading) volumes. 

In a regime with time-scales of minutes and longer, considerable statistical 
features are known to exist |5|, ||, [7|]. For an appropriate small time-scale At 
(hereafter chosen as one minute), (i) r^t has an autocorrelation function 
(ACF) that almost vanishes beyond several minutes of time lag, whereas (ii) 
\r\ t \ has a long correlation ranging from several minutes to more than several 
months or even a year. In fact, the ACF for |rAt| has a power-law decay that 
is valid for such a long range of time lags. While (i) is interpretable as the 
absence of arbitrage using linear prediction, the long memory (ii), known as 
volatility clustering in economic literature, is not completely understood in 
its origin. 
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We assume from these observations that self-similarity property which 
will be formulated shortly holds for time-scales between t s and T, where t s 
is the small-, and T the large-scale cut-off. The value of t s is typically one 
minute or less, while T may range from several months to one year being 
regarded as a parameter. Using an appropriate minimal time-scale At, we 
focus on the magnitude of the coarse-grained return, 



yt{s) 



t/At-l 

r At (s-kAt) 

k=0 



\n(s)\. (2) 



The variable t is the coarse-graining time-scale. Fig. [I] exemplifies in an actual 
data set the additional property of (iii) self-similarity and intermittency for 
values of y t with different time-scales t. The main aim of this work is to 
propose a new approach for analyzing this self-similarity. 
First, we introduce the coarse-graining step n as 

t = t n = Te~ n (3) 

for n = 0, 1, . . . , N (N = log(T/£ s ) ^> 1), and y n = y t „. Next, we assume that 
the ratio y n+ i/y n is statistically independent of n, i.e., statistically steady for 
1 < w < This ansatz is the expression of the self-similarity assumption 
for price fluctuations. That is, by using the equation 

V -^ = e z \ (4) 

Vn 

z n is statistically steady with respect to n for the similarity region 1 Cn<C 
N. If we introduce the exponent z(t) as 

wt~r*w (5) 

z(t) corresponds to the local (Holder) exponent in multifractal theory. Except 
for the t-dependence of the magnitude y t , the similar temporal evolution of 
Ut for different values of t (Fig. |l[) is the origin of the self-similarity ansatz. 
It follows immediately from eq. (0J) that 

n-l 

y n = y T exp ^ z k = y T exp(ra^), (6) 

fc=0 

where = (1/n) Y^k=o z k- We assume that yx = y n =o has no fluctuation 
in the sense that one can neglect fluctuation if the coarse-graining scale is 
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beyond the time-scale T. Large deviation theory (e.g. ||ll|| ) in probability the- 
ory tells us that this self-similarity hypothesis, the statistical independence of 
exponents Zk in the coarse-graining step k, leads us to express the probability 
density function (PDF) Q n (z) of the average as 

Q n (z) oc e~ nS ^, (7) 

for 1 C n <C iV. Precisely speaking, eq. (|7|) is valid for n ^> n c , where n c is 
the correlation step of z n (e.g. defined as the decay step of the correlation 
function of (Sz n 5z ), where 5z n = z n — (z n )). Here the positive function S(z), 
being independent of n for n ^> n c , is known as a fluctuation spectrum or 
Cramer function, whose functional form characterizes the self-similar fluctua- 
tions and can be immediately determined by observing the price fluctuations. 
Under the variable transformation given by Eqs. and (|5|), 

z t (y)=\og(y/y T )/\og(T/t) (8) 

(denoting ~z^(y n ) as z t for short), the PDF Q n (z) is transformed to the PDF 
of y t , Pt(v), as 

p t{y) « - t • (Q) 



y \t 

As a dataset for investigating this self-similarity hypothesis, we used 
the Trade and Quote (TAQ) database of intraday transactions provided by 
NYSE, Inc. The prices of a particular stock recorded between Jan. 4, 1993 
and June 30, 1999 were used. The median of the time differences between 
quotes was 12 sec. The timing of the quotes was used to construct a time- 
series of returns r^ t for At =1 min, excluding midnights, i.e., the returns 
corresponding to pairs of closing price and next day's open. The total dataset 
is comprised of 638,901 records. Next we obtained an ensemble for y t with 
different time-scales t by using eq. (0). Other stocks with high volumes were 
analyzed and yielded similar results to those given below. 

Removing the intraday periodicity from the time-series is a vital but dif- 
ficult task. We used essentially the same method as that in || (see also [0])- 



Note that a daily period for the NYSE from open to close is approximately 
390 minutes. The observables adjusted with the intraday periodicity will be 
referred to as r t and y t . 

Let us now calculate how the self-similarity ansatz formulated above ac- 
tually holds in the range of f s C t C T. This can be done by directly 
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examining the scaling relation, eq. (||), rewritten as 

log[yP t (y)]/log(T/t) = S(z t (y)), (10) 

where we ignored an additional constant term which corresponds to the nor- 
malization of the PDF. The result is summarized in Fig. |2|. It shows that 
the curves for different coarse-graining time-scales t collapse. The value of 
T is taken to be 40960 min, t = 10,20,40, . . . and 1280 min, and yx is set 
to be the mean of the data for the cut-off time-scale T. The whole set of 
curves are shifted vertically so that the minimums are at the origin of the 
vertical axis (this corresponds to the additional constant ignored in eq. (|i~0|) ). 
This invariant, i.e. the t-independent scaling relation is the main result of 
the present work. It supports the self-similarity ansatz and yields important 
information about the functional form of S(z). Note that the minimum z m 
for S'(z m ) = is approximately —0.5 demonstrating the diffusive nature of 
(yt) ~ Vi in eq. (0). 

It is easily shown that if the Cramer function is a linear function, S(z) = 
a(z — zq), in a certain region of z, the PDF for y t takes the power law 
Pt{y) ~ y al m the corresponding region of y. Data analysis shows that 
the region dS(z)/dz < can be closely approximated by using a = — 1. This 
corresponds to the fact that Pt{y) has a constant value for small fluctua- 
tions in y. It should be noted that this region, and that for even smaller 
fluctuations, is subject to the fact that price value is actually discrete (not 
shown in Fig. |]), which is of limited interest. On the other hand, the region 
dS(z)/dz > describes large fluctuations, which is related to previous work 



on the scaling of moments reported in Refs. [§, |9], [L0| . That is, the moments 
(yl) of order q scales with t as (y1) oc t^ q \ Such scaling is related to the 
approach presented here. Actually, the relation between (f>(q) and S(z) can 
be found if we note that 

(y q t )^y q T (T/tr~ s ^dz } (11) 



by using eq. (|9]). If we assume that the scaling is of the form, (yl) oc t^ q \ and 
S"(z) > 0, evaluating eq. (|TTD by using the steepest-descent method yields 

4>(q) = mm[S(z) — qz\. (12) 

Thus <j)(q) and S(z) are related to each other by the Legendre transformation 
of eq. (p 
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Our self-similarity formulation originates from Kolmogorov's lognormal 



theory in the statistical theory of turbulence [13], and has been recently ap 



plied, from the statistical viewpoint of large deviation theory, to analyze 



the shell model of turbulence |L4j and the long-time correlation in on-off 
intermittency fllSl . Indeed the return r t for duration t corresponds to the 
velocity difference across a spatial distance arbitrarily chosen in the iner- 
tial subrange in turbulence, where the self-similar energy cascade process is 



observed fl3| , |TT|. In the study of on-off intermittency which is typically ob- 
served right after the breakdown of synchronization of coupled chaos [15) , the 
difference of dynamical variables of chaotic elements also shows self-similar 
characteristics. In connection with this phenomenon, r t corresponds to the 
difference of dynamical variables which exhibits on-off intermittency. In the 
latter analogy, we note that the corresponding variable possesses similar prop- 
erties as the above (i)-(iii) for the price fluctuations. The present analysis is 
useful for these systems which show self-similarity due to strongly correlated 
fluctuations either in space or in time. 

Fig. ^ compares S(z) with the result numerically obtained from <p(q) by 
the Legendre transformation in the region of dS(z)/dz > 0. Order q of 
the moment is increased from to 8, near and beyond which the scaling 
of moments, (yl) oc t^ q \ becomes unclear. This might be related to an 
ill definition of higher moments caused by the heavy-tail behavior of the 



PDF for y t [16|. Near the minimum of S(z) (z < —0.3), the curve obtained 
from (p(q) can be closely approximated by a quadratic polynomial H, and 
fits reasonably to the curve of S(z). However, it deviates significantly from 
S(z) for the region of larger fluctuations, where S"(z) approaches zero. This 
indicates a breakdown in the relation between <fi(q) and S(z), which is based 
on the concavity of S(z). Actually, if we approximate S(z) in the region 
—0.3 < z < —0.1 by using the linear relation, S(z) = a(z — z ), with a ~ 3, 
the result corresponds to the power- law Pt{y) ~ y~ 4 - This is compatible with 
the heavy-tail behavior found in [[UJ, although more detailed study is needed 
to clarify the functional form S(z) for this region and those of even larger 
fluctuations. 

The Cramer function S(z) is related to the multifractal spectrum inves- 
tigated in ||. Suppose the time-series with span T is divided into (T/t) 
intervals each of which has a time span t. By evaluating the exponent z(t), 
eq. (HD, in each interval, one obtains the number density N t (z) for the inter- 
vals such that z(t) is between z and z + dz. The fractal dimension f(z) of 
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the support for those intervals is defined by 



N t (z) ~ (f/D-'M. (13) 

Noting that Qt(z) ~ N t (z)/(T/t) results in the relation 

S(z) = l-f(z). (14) 

Thus the multifractal spectrum f(z) is essentially equivalent to the present 
Cramer function S(z). It should be mentioned, however, that the "dimen- 
sion" f(z) can take negative values because, in principle, S(z) can take any 
positive value. We also stress that the function S(z) can be directly calcu- 
lated from data without resorting to the scaling of moments and the Legendre 
transformation as is necessary to evaluate f(z). These facts may suggest that 
the concept of S(z) is more useful than f(z). 

In summary, we have found that self-similarity of volatility at different 
time-scales in the strongly correlated regime can be characterized by direct 
evaluation of the Cramer function. The scaling relation eq. ( |iH| ) is our main 
result, which was a consequence of the self- similarity and large deviation 
theory. We have also shown the relations of the Cramer function S(z) to the 
scaling-of-moments method and to the multifractal spectrum. The functional 
form of S(z) is compatible with the power-law ("cubic law") behavior at 
extremely large deviation where S(z) deviates from quadratic form. 

Finally we would like to remind the reader that price fluctuates inevitably 
near the critical point of the apparent equilibrium between demand and sup- 



ply [|17[ . The critical point is not observable to any market participants whose 
expectations and memory give rise to the interesting statistical features of 
price fluctuations. It is indeed intriguing that the critical fluctuations in 
market can be well characterized by methods in non-equilibrium statistical 
physics. While we have not reached an understanding of how market dynam- 
ics gives rise to the scaling and self-similarity found here, we believe that our 
finding and analysis will provide a good basis for phenomenology challenging 
any models used to understand market dynamics. 
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Figure 1: Time-series of price fluctuations magnitudes yt = |j"t(s)| (normal- 
ized by standard deviation of each after removing intraday periodicity) for 
time-scales t = 10, 20 and 40 min. Similar temporal evolutions are visible. 




Figure 2: Variable z vs. S(z) (Eqs. (||) and (PH)) for time-scales t = 
10, 20, 40, . . . and 1280 min, when T = 40960 min. Plots lie on similar curve, 
which represents functional form of Cramer function S(z). Note that S(z) 
has minimum at z ~ —0.5. 
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Figure 3: Details of Fig. |2] in region dS(z)/dz > for time-scales up to 
t = 320 min. Compare with the result (heavy line) obtained from scaling of 
moments (of order < q < 8) by Legendre transformation. Broken line is 
quadratic-polynomial fitting for solid line and an extrapolation (lognormal 
approximation). 



